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Abstract. In this paper, we introduce a geometric statistic called the sprawl of a group with respect to 
a generating set, based on the average distance in the word metric between pairs of words of equal length. 
The sprawl quantifies a certain obstruction to hyperbolicity. Group presentations with maximum sprawl 
(i.e., without this obstruction) are called statistically hyperbolic. We first relate sprawl to curvature and 
show that nonelementary hyperbolic groups are statistically hyperbolic, then give some results for products 
and for certain solvable groups. In free abelian groups, the word metrics asymptotically approach norms 
induced by convex polytopes, causing the study of sprawl to reduce to a problem in convex geometry. We 
present an algorithm that computes sprawl exactly for any generating set, thus quantifying the failure of 
various presentations of Z"* to be hyperbolic. This leads to a conjecture about the extreme values, with a 
connection to the classic Mahler conjecture. 



We will define and study a new geometric statistic for groups in this paper, called the sprawl of a group 
(with respect to a generating set). Sprawl measures the average distance between pairs of points on the 
spheres in the word metric, normalized by the radius, as the spheres get large. This gives a numerical 
measure of the asymptotic shape of spheres that can be studied for arbitrary finitely generated groups and 
locally finite graphs. 

To be precise, let 



provided this limit exists. Note that since < d{x, y) < 2n, the value is always between and 2. By way of 
interpretation, note that E = 2 means that one can almost always pass through the origin when traveling 
between any two points on the sphere without taking a significant detour. (The name is intended to invoke 
urban sprawl: a higher value means a lack of significant shortcuts between points on the periphery of the 



As we will see, this statistic is not quasi-isometry invariant but nonetheless captures interesting features 
of the large-scale geometry, to be developed in ^ Sprawl has connections to other geometric statistics such 
as divergence, almost-convexity, and discrete Ricci curvature. After explaining why this statistic detects 
curvature properties, we show below that non-elementary hyperbolic groups always have E{G, S) — 2 for 
any generating set, so we can think of 2 — £' as quantifying an obstruction to hyperbolicity in groups. We give 
some results about sprawl for non-hyperbolic groups, including product groups and some solvable examples 
(lamplighter groups). 

Free abelian groups and convex geometry. For free abelian groups, there are particularly clear results 
on the asymptotic shape of spheres and the distribution of their points that allow us to compute the sprawl. 
As we will review below in the word metrics on 'Z'^ are close at large scale to certain norms, and the points 
of the spheres are distributed in a way that tends to a limit measure on the unit sphere in the norm. This 
allows us to replace an asymptotic computation on large discrete spheres by a finite computation: integrating 
average distance on a polytope against an appropriate measure. We give an algorithm for performing this 
calculation for arbitrary (Z'^, S) in i j4.1l Though we can compute sprawl exactly for any finite presentation 
of Z'^, it is still an interesting problem to find the extremal values over all generating sets. That is, we are 
studying a group statistic that depends on the choice of generators, but how much can it vary? This becomes 
a (possibly hard) problem in convex geometry, which we will study below. 



1. Introduction 
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Definition 1. A convex body is a convex set in M.'^ with interior. A perimeter is the boundary of a centrally 
symmetric convex body in W^. 

(To emphasize this point: we are using the word "perimeter" in a special way, which includes the assumption 
of central symmetry. Accordingly, we assume that our generating sets S for Z'' are symmetric, so that 

We will show that a generating set S for Z** induces a perimeter L in a very simple way {L is just 
the boundary of the convex hull of S in R'') and that the sprawl E{'Z'^,S) — E{L) depends only on L. 
Furthermore E[L) = E{TL) for linear transformations T, so sprawl gives an affine geometric invariant: 
average distance between two points on the perimeter, where both the distance and the measure have 
natural intrinsic definitions with respect to the shape. We conjecture that the cube and the sphere are 
the extreme shapes in every dimension, and we give some rigorous evidence for that in [j4][5l This would 
mean, for example, that over all generating sets for 1? , the values achieved by sprawl are pinched between 
4/7r 1.273 and 4/3 ~ 1.333. This extremization problem resembles the well-known Mahler conjecture in 
convex geometry, a parallel developed in the last section below. 

Acknowledgments. We thank Alex Eskin, Ralf Spatzier, and Greg Kuperberg. The first author is partially 
supported by NSF grant DMS-0906086, the second author is partially supported by ANR grants 06-BLAN- 
0038 and Project Modunombres, and the third author is partially supported by NSF grant RTG-0602191. 

2. Hyperbolic groups and statistically hyperbolic groups 

In a graph, let us adopt the convention that for a real number r > 0, the notation Sr denotes the metric 
sphere of radius [rj . We will study the Cayley graph as the metric model of a group, adopting the convention 
that the points of our metric space are the vertices (that is, the elements of the group), endowed with the 
distance induced by the edges (the word metric). We will write /3(r) := ^Br{e) to denote the number of 
group elements in the closed ball of radius r about the identity (or by translation-invariance, about any 
other center) in the group. 

2.1. Hyperbolicity. A metric space is called 5 -hyperbolic (or just hyperbolic, without specifying a value 5) 
if every geodesic triangle has the property that each side is contained in the ^-neighborhood of the union 
of the other two sides. In such a space, suppose two geodesic rays share a common endpoint. Then if they 
become separated by 25 at time t^, they must subsequently diverge completely: the two subrays after this 
separation can be concatenated to form a complete quasigeodesic, because for t>to, any geodesic segment 
connecting 71 (i) and 72 (i) must return to a 25- neighbor hood of 7i(io)- This means that the distance between 
71 (i) and 72(0 is at least 2{t — — 5). Since we have strong estimates on the distance after the rays stop 
fellow-traveling, our task for hyperbolic groups will be to get quantitative control of the fellow-traveling. 

To illustrate the issues involved in finding the sprawl of a group, first consider the free (nonabelian) group 
F2 with its standard generating set. (Here and from now on, std will denote the standard generating set for 
a group) . The Cayley graph is a 4-regular tree, and to evaluate the average on the sphere directly, one forms 
a finite sum by fixing one point on the sphere and then counting the other points of Sn at various distances 
from the first: 

V d{x,y) = -(2n)-Ki-(2n-2) + ii-(2n-4) + iii-(2n-6) + ---+0. 

As n — >■ c», this can be evaluated using a geometric sum, and one computes in this way that i?(i^2, std) = 2. 
This argument, however, is sensitive to the choice of generating set. What would happen for some other 
generating set? Does the 5-hyperbolicity of the model space suffice? The answer is "No" in general. One can 
easily construct trees with sprawl any number between and 2, trees where sprawl does not exist, and trees 
where sprawl depends on basepoint. These trees are highly nonhomogeneous and are not quasi- isometric to 
any group. 

Remark 2 (Sprawl and classical curvature). Moving beyond locally finite graphs and groups, we can define 
the sprawl for metric spaces that have natural measures on spheres. Instead of counting measure one may 
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take HausdorfF measure in the appropriate dimension, for example. Thus for a space and measure {X,fi), 
we can write 



One can quickly show that the hyperbolic plane (and thus hyperbolic space of any dimension) has E ^ 2: 
for two rays making angle 9 at their common basepoint, d(7i(i), 72(i)) > 2t — c{9), where c{9) is a constant 
depending on 9. 

Indeed, it is not hard to identify a relationship between sprawl and curvature: if Er is defined to be the 
average distance between pairs of points on Sr and is the model space of constant sectional curvature 
K, it is easily observed that for every fixed value of r, the values Eri^M^) are strictly decreasing in k (taking 
K < TT^r^ so that Sr is non-empty). 

However, a (5-hyperbolic space, indeed even a tree, need not have E — 2, and exponential growth of 
balls or spheres does not suffice. For instance, consider modifying the four-regular tree by choosing one axis 
and modifying the degree at each vertex in that axis as a function of distance from the origin. Examples 
constructed in this way can achieve all values < < 2, and can also have E not exist or depend on 
basepoint. Thus to prove that hyperbolic groups have maximal sprawl, it is essential to make use of the 
homogeneity guaranteed by a transitive group action. We will use this by appealing to a strong result of 
Michel Coornaert giving definite exponential growth (not just a growth rate but furthermore a bound on 
the coefficients) for hyperbolic groups. 

Remark 3 (Divergence, almost-convexity, discrete Ricci curvature). Recall that sprawl is measured by 
computing the distances between pairs of points x,y S'„, then taking the average and letting n — ^ oo. At 
least three other geometric statistics also study the geometry of pairs of points in the sphere. 

• Divergence is measured by minimizing the length of paths between x,y € Sn such that the path 
lies outside of i?„, then taking the sup and letting n — 7> oo. This is widely studied for groups, for 
instance in [m HOI [Ml E] . 

• Almost-convexity for groups is measured by minimizing the length of paths between x,y Sn such 
that the path lies inside of i3„. This was defined by Cannon in [3] and further explored in many 
papers, such as [T^ 151 H]. 

• Ricci curvature for manifolds is defined by considering infinitesimal spheres at a pair of basepoints, 
and measuring the average distances between corresponding points on the spheres. If that distance 
is greater than the distance between basepoints, then the curvature is negative; if smaller, then the 
curvature is positive; and if equal, then the curvature is zero. Discrete Ricci curvature mimics this 
construction in a manner usable for groups by measuring distances between corresponding points in 
metric spheres at different basepoints. This was defined by Yann Ollivier in fTB^ and compared to 
optimal transport definitions of Cedric Villani and coauthors in |17j . 

Thus the definition of sprawl gives it a family resemblance to other synthetic curvature conditions that have 
already proved useful. 

Recall that a hyperbolic group is called elementary if it is finite or has a finite-index cyclic subgroup. 

Theorem 4. Let G be a non- elementary hyperbolic group. Then E{G, S) — 2 for any finite generating set 
S. (That is, every presentation is statistically hyperbolic.) 

Proof. Recall that z is said to be (metrically) between x and y if d(x, z) + d(z, y) = d{x, y). A set is between 
two other sets if there exists a triple of points, one from each of the sets, satisfying the betweenness condition. 

Choose any < p < 1 and x G S'„, and let x' be an arbitrary point on 5p„ between e and x. We need to 
bound the number of w e S'„ such that B2s{x') is between e and w. But if w' is a point in 5p„ between e 
and w, then d{w' , w) = n— \_pn\ . That means that the number of such w is overcounted by I-B215I • \Sn-ypn\ I- 

For every point v of Sn which is not of this kind, d{x, v) > 2{n — \_pn\ — 5) > 2{n — pn — 6) because the 
geodesies from the identity to x and to v have 25-diverged by time lpn\ . Thus, 





x,y€S, 
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Now we make use of the homogeneity. Coornaert proved in [S] that for every non-elementary hyperbohc 
group with fixed generating set, there are bounded coefficients of exponential growth: 



(t) 



3ci, C2 > 0, w > 1 



s.t. 



n 



ft follows from these inequalities that 



\Sn~lpn\ I _ /3(rt - LpnJ) - I3{n- [pn\ - 1) 
\Sn\ " - /3(n - f ) 



^ 



as n — ?> oo, which together with Q gives us 



E{G, S) 



hm 



>2(l-p). 



Since < p < f was arbitrary, this means E — 2. 



□ 



To quickly clarify the necessity for the non-elementary hypothesis: for G — Z and any finite generating set, 
the spheres of large radius are divided into a positive part and a negative part, each of uniformly bounded 
diameter. Thus a pair of points has bounded distance with probability 1/2 and distance boundedly close to 
2n with probability 1/2. This gives E{Z, S) = 1 for all finite generating sets S. 

2.2. Some statistically hyperbolic groups and spaces. Here we exhibit several examples of non- 
hyperbolic groups with statistically hyperbolic presentations. We first consider groups that are direct prod- 
ucts with a hyperbolic factor, and then use the results on products to consider Diestel-Leader graphs. 

Let us say that a based space (H, ho) has definite exponential growth if the growth function /3{n) of balls 
of radius k centered at h^ in H satisfies ([f]) . Given a sequence of finite sets An , we will say that almost all 
points of An satisfy a property (P), or that the property has full measure, if the subset of elements satisfying 
(P) has proportion tending to one as n — > cxi. 

The subtlety in analyzing products is that the sphere of radius n projects to not a sphere but to a ball 
in each factor. Thus we need estimates for distances when points are on spheres of different radii; we can 
use definite exponential growth in one factor to get control on the difference in radius (so that most of the 
projection is in an annulus An), and then use hyperbolicity to get the distance estimates. We also need to 
know that spheres in these annuli are evenly covered by which we mean that there is a function /„ : N — > N 
such that # (7r-i(/i) n S^) = /„(|/i|ff) for almost all h e A^ . 

In the following technical lemma, the reader should imagine that H is a direct factor of X and that 
TT : X — > _ff is coordinate projection. Recall that a semi- contraction is a distance non-increasing map. 

Lemma 5 (Annulus lemma). Let {X, xq) and {H, ho) be based graphs, suppose H is 5 -hyperbolic with definite 
exponential growth, and fix any < p < 1. Consider the annulus An = Bn {ho) \ B^n-i{f^o) in H and the 
sphere Sn = Sn {xo) in X. Let tt : {X,xo) — {H,ho) be a semi- contraction, mapping almost all points of 
Sn into An such that spheres in An are evenly covered. Then 



This is proved by showing that when pn <i,j< n, then the average distance between a point in and 
a point in Sf is bounded below by i + j — 2(5 — 2pn — cnuj^P"^ for a constant c, where uj is the growth rate 
of H, as in 

We will apply this lemma to products of the form H x K where H is hyperbolic and K grows strictly 
slower — that is, K has subexponential growth, or has a smaller exponential growth rate. Let us say that a 
generating set for a product is split if every generator projects to the identity in one of the factors. 

Proposition 6 (Products with a dominated factor). Suppose that H is a non- elementary hyperbolic group, 
K is finitely generated, and S is a split finite generating set for H x K such that the growth function of H 
dominates the growth function of K with generators projected to the factors from S . Then (H x K, S) is a 
statistically hyperbolic presentation. 
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Proof. Let TT be projection to the H factor from X = H x K and note that 

k 

^k = [_J ^ ^k~i- 
i=0 

Thus one easily verifies the hypotheses of the annulus lemma. Letting p ^ 1 gives E{X, S) = 2. □ 

Another class of statistically hyperbolic spaces is the Diestel-Leader graphs. We describe them briefly 
here and refer the reader to [18^ for a more thorough treatment and some relevant properties. For m, n > 2, 
take an (m + l)-valent tree Ti and an (n + l)-valent tree T2. Choose ends and corresponding horofunctions 
/i and f2- This gives height functions hi ~ fi and /12 = on the trees. We visualize Ti as "growing 
up" from its end at height — cxd and T2 as "hanging down" from its end at +00. The Diestel-Leader graph 
DL{m, n) is defined to be the subspace of Ti x T2 on which hi = h2. A height function h is induced on this 
graph from the tree factors, since their height functions match. Like Cayley graphs, Diestel-Leader graphs 
have vertex-transitive group actions by isometrics, which guarantees that geometric invariants of DL{m,n) 
do not depend on the choice of basepoint. 

These graphs are considered models for solvable geometry: the structure described above is in precise 
analogy with the geometry of Sol, which has hyperbolic plane factors in the place of trees. Eskin, Fisher, 
and Whyte [9] exploit this analogy to completely classify Diestel-Leader graphs and spaces with Sol geometry 
up to quasi-isometry. Furthermore, for m > 2, DL{ni, m) can be realized as Cayley graphs of solvable groups, 
namely the lamplighter groups F } Z where is a finite group of order m. 

Denote the coordinate projections by TTi : DL{m, n) Ti for z = 1, 2. A geodesic 7 in DL{m, n) is said to 
turn if it switches from increasing in height to decreasing in height or vice versa. Geodesies in Diestel-Leader 
graphs have at most two turns. The following lemma tells us that spheres of large radius in a Diestel-Leader 
graph are "concentrated in distant heights." 

Lemma 7 (Concentration in height). Let X = DL{m,n) be a Diestel-Leader graph, xq (z X be a basepoint 
at height 0, and < p < 1. Denote by Sk the sphere of radius k in X centered at Xg. For almost all x Cz S^, 
pk < \h(x)\ < k. If m > n, then for almost all x S S}^, pk < h{x) < k. 

Proof. Assume m > n and consider the problem of counting fc-tuples {xi,X2, ■ ■ . ,Xk) of vertices of X such 
that each pair {xi,Xi^i) bounds an edge and the concatenation of these edges forms a geodesic in X. If 
we start by choosing xi to be immediately above xq, then we have m choices, since there are m vertices of 
Ti immediately above 7ri(xo) and only one vertex of T2 above 7r2(a;o). If X2 is chosen immediately above 
Xi, then there are choices for the pair {xi,X2). In general, there are m* ways to choose a tuple of 
vertices {xi, . . . ,Xi) such that Xj+i is immediately above Xj. Suppose we now choose x^+i below Xi. Then 
TTi(xi+i) = TTi(xi-i), which means that we have lost one of our choices for a vertex in Ti. This choice is 
replaced by the choice of a vertex 7r2(a;i+i) immediately below 7r2(xi) other than 7r2(xi_i). There are n — 1 
such possibilities. If we now continue choosing vertices to be decreasing in height, then we continue replacing 
factors of m* with factors k < m. So turns in a geodesic reduce the number of choices and geodesies continue 
in the same direction for as long as they can before turning. If m = n, then the same argument applies if 
we begin choosing xi immediately below xq, so geodesies tend to end in heights which are distant in either 
the positive or the negative direction. If to > n, then the above argument shows that geodesies in fact tend 
to end in high (positive) heights. □ 

Theorem 8 (Diestel-Leader graphs). For any m,n>2, the Diestel-Leader graph X = DL{m,n) is statis- 
tically hyperbolic. 

Proof. Let Sk denote the sphere of radius k centered at a point xq of height zero and let p be fixed. We will 
begin by considering the case where m > n. tti : X — >■ Ti is a semi-contraction, since tti takes paths in X to 
paths in Ti while preserving their length. By the previous lemma, a full-measure subset oi Sk lies above 
height pk. A similar argument can be used to show that the same is true for Ti, and that a full- measure 
subset Vk of the annulus Ak = Bk \ Bpk centered at 7ri(a;o) lies above height pk. In fact, Vk — ni{Uk). In 
order to apply the Annulus Lemma, we just need show the even covering condition. Suppose y (z Vk with 
d — d{y,Tri(xQ)), and let 7 be a geodesic in X of length k starting at xq and ending at a point of Tr~^{y). 
Then 7ri7|[o is a path from 7ri(a;o) to y. Such a path may initially decrease in height, and so choices are 
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made in the T2 coordinate. But since y is above height zero, 7 must then come back up and any choices 
made in T2 for the initial portion of 7 will have no effect on where 7 ends. The only significant choices 
in T2 for 7 occur after 7ri7 passes y and turns around again. This final downward portion of 7 has length 
(fc — d)/2. So k and d must have the same parity and the number of points in the preimage of y is a function 
of d. Thus we apply the annulus lemma to get E{X) = 2. 

If m = n, then above argument shows that the average distance between a pair of points in Sk above 
height is close to 2k. By symmetry, it follows that the average distance between a pair of points in Sk 
below height is also close to 2k. But a significant proportion of pairs x,y G Sk will have the property that 
h{x) > and h{y) < 0. By the previous lemma we may again assume that h(x) is close to k and that h{y) 
is close to — fc. So the difference in heights, a lower bound on distance, is close to 2k. □ 

When TO — n, the Diestel-Leader graph DL{m, to) can be realized as the Cayley graph of the lamplighter 
group Z,„ ; Z for a certain natural generating set (or, more generally, F I Z for any finite group F of order 
to). These interesting solvable groups are not nilpotent and they are not finitely presented. 

Corollary 9 (Lamplighter groups). The lamplighter groups Z„i?Z have statistically hyperbolic presentations. 

Finally, besides Euclidean space itself, the symmetric spaces of noncompact type also have E = 2 (essen- 
tially because of the probability tending to zero that pairs of points lie in a common flat). We know of no 
examples of groups of exponential growth with E < 2, but because of the facts above it would be natural to 
expect that groups of non-uniform exponential growth need not have E = 2. 

3. Reducing from free abelian groups to convex geometry 

In the free abelian groups Z'*, studying the large-scale metric geometry is greatly aided by the natural 
embedding in R'^. It is known that the finite word metrics on Z'^ are asymptotic to norms on (originally 
due to Burago [2], and shown by an elementary geometric argument in [6]), so that these norms can be 
thought of as limit metrics coming from group theory. Recall that any convex, centrally symmetric body in 
R'^ induces a Minkowski norm, namely the norm for which that convex body is the unit ball. If a generating 
set for Z'* is called S, let |w| denote the length of w G Z'^ in the word metric, and let L be the boundary of 
the convex hull of S in R'^. Then the Minkowski norm \\ • ||^ having L as its unit sphere is the limit metric, 
in the sense that there is a constant K depending on S such that 

||w||l < |w| <M\l + K 

for all w e Z'^. This limit shape L describes the asymptotic shape of spheres in the sense that ^Sn L (say 
as a Gromov-Hausdorff limit). 

In an earlier paper, we proved counting results for spheres in word metrics on Z'', showing that counting 
measure on the discrete spheres Sn converges to the cone measure on L, as pictured in Figure [TJ The 
case of that theorem that is useful for us here states that 

Tq~i2 E -^(X'y)= / llx-ylli '^AiL^(x.y)• 
n^oo |5'„|^ ^^^^ n 7^2 

(The original theorem addresses more general averaging problems.) Thus it follows immediately that 

i?(Z^5)= / \\x~y\\^df,^\x,y) 

for all finite generating sets S. 

We can define the sprawl of any perimeter L by the right-hand side, which we can denote by E{L), 
measuring average distance between points of L as measured in its intrinsic geometry. We remark that 
E{L) = E{TL) for any linear transformation T : ^ W^, since both the norm and the measure push 
forward under linear transformation. That is, ||rx— TyWj,-^ = ||x — y||^, and dfj,rp^{Tx) = d^^{x). 

One immediate consequence of the reduction to convex geometry is that £'(Z'^, S) is always greater than 
1/2. 

Proposition 10. E{L) > ^ for all perimeters L in R'^ . 
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Figure 1. Six arcs are shown in red in this figure, each having cone measure 1/14; in other 
words, all of the colored regions have 1/14 as much area as the convex body they are in. 
In the square and the hexagon, all sides have equal measure. On the other hand, for this 
octagon generated by the chess-knight moves {(±2,±1), (±1,±2)}, the measure of its two 
types of sides (shown with green and blue) is in the ratio 4:3. Cone measure is defined on 
any perimeter, and in particular it is uniform on the circle. 

Proof. Fix an arbitrary point x G L, and denote by Q the convex body of which L is boundary. The points 
of L whose distance from x is less than one are those contained in Q + a;, the translated copy of Q centered 
at X. Since L + x contains and Q is convex, there is a hyperplane P through which does not intersect 
the interior of Q + x. So the interior of Q + x is on one side of P, and by central symmetry, half of the 
cone measure lies on each side of P. Thus the average distance on L from 2: is > (1/2)(1). To obtain the 
strict inequality, just note that the distance from x to —x is always 2 and so a small neighborhood of ~x 
contributes an amount near 2 to the average. □ 

4. Sprawl in the plane 

From the work above, we have reduced the group calculation E{'L'^ , S) to the convex geometry calculation 
E{L). In this section we study this convex geometry in dimension 2, by first introducing an algorithm for 
evaluating E{L). This algorithm can be given to a computer (which we did, producing a great deal of 
experimental evidence for the conjectures to follow) but can also be used to produce precise formulas, such 
as those given below for the regular polygons. 

4.1. Cutline algorithm. To compute the sprawl of a polygon, we can average the expected distances 
between pairs of sides. Pick two sides a and t of L and parametrize each of them (say clockwise) by [0, 1]; 
then the distance in the L-norm from a(s) to T(t) is piecewise linear. Thus for an appropriate triangulation 
of the parameter space, average-distance is a linear function on each triangle. We outline here a method for 
triangulating, which we call the cutline algorithm for computing the sprawl of a polygon. We note that the 
algorithm generalizes straightforwardly to higher dimensions. 

Fix a and t. Find the sector of angles at which the sides "see" each other — that is, the interval of 
arguments obtained by vectors from cr{s) to r(i) — as in the first picture in Figure [21 Considering the same 
sector of angles viewed from the origin, as in the second picture, mark the angles that point in vertex 
directions in this sector (shown as a dashed line). 




r 



Figure 2. A depiction of the algorithm for finding the average distance between sides a and r. 

For each vertex direction 6, consider the line T C [0, 1] x [0, 1] of times at which the vector between the 
sides points in the vertex direction; the corresponding chords form a trapezoid as in the third picture. For 
each trapezoid, record the lengths of its bases, marked in the figure as Iq and h. (In general, for direction 
di, these are the largest and smallest values of d{a{s),T{t)) for {s,t) G Ti, and can be denoted Z^q and In.) 
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Let doo, doi, dio, dn be the four distances between an endpoint of a and an endpoint of r (measured in the 
L-norm), with dij = d{a{i),T{j)). 

Next, consider the unit square formed by the parameters [0, 1] x [0, 1]. All the distances between points 
on the two chosen sides of the polygon can be recorded by a real-valued function on this square. To find 
the average distance between sides a and r, we only need to integrate that function over the square (using 
Lebesgue measure because the cone measure is proportional to arclength on each side; the proper weights 
will be restored below). Since the function is piecewise linear, it will suffice to know its values at the points 
of a triangulation that is fine enough that the function is linear on each triangle. 

For each vertex direction 9i, the corresponding times Ti cut out a straight segment across the square, which 
we will call a cutline. The values at the corners of the square are the dij and the values at the endpoints 
of the cutlines are the lij. If the outlines do not triangulate the square, add dummy outlines as needed 
(between these same points, so requiring no further distance calculations) to complete a triangulation. One 
such dummy cutline is shown in the figure. 

Now the average distance between a point on side a and a point on side r can be read off of this parameter 
square by just knowing the values at the vertices of the triangles: for each triangle, average the values at its 
vertices, and then sum those averages over all the triangles, weighted by the areas of the triangles. Thus let 
Eij denote the average distance between ai and aj. Let Wi be the weight of the ith side in the cone measure: 
Wi = iij^{ai). Then, finally, the average distance between all pairs of points on the polygon can be written 
as the weighted average: 



4.2. Values. By applying the cutline algorithm, we find formulas for the sprawls of regular polygons. 
We note that the regular hexagon is equivalent by linear transformation to the hexagon with vertices 
±(1, 0), ±(1, 1), ±(0, 1), which is the limit set for the generating set S = ±{ei, 62,61 + 62}. For regular 
polygons with at least 8 sides, however, they are not exactly realized by word metrics on Z^. 

Proposition 11. Let Pk he the regular k-gon and let he the round unit circle. Then 
E{I?,±{ei,e2})^E{Pi) = |; 
S(Z2,±{ei,e2,6i +62}) =-B(P6) = i<|; 

E{Ps) - <Y^' 



E{P,) 



EiS') 



4 / tt/x ^ 1 
TT \tan(7r/x) 



iftan(f)), xem, 



^^"^ ksin(f)), xe4N + 2, 



TT \sin(7r/x) 6 
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Proposition shown in Figure |3] below, shows of the nontrivial dependence of sprawl on the choice 
of generating set. Since the word metrics of a group G with respect to finite generating sets S, S' are 
quasi-isometric, we see that sprawl is not a quasi-isometry invariant. 

To prove the formula for regular polygons, one can set aj for the average distance from (Ti to aj and 
re-express that using the chordlengths £i — (i(vi,Vi), by the cutline algorithm. The £i themselves can then 
be written as trigonometric functions of n/x. Trigonometric identities finish the proof, since E[P^) is the 
weighted average of the aj. 

We note that the formulas for regular polygons each converge quickly to 4/7r, and track close together. 
Writing E4^^(x) for a function whose values agree with E{Px) when x G 4N, and likewise i?4N+2(a;); we have: 

4 167r^ 4 ITtt^ tt^ 

i;4N(a;) - - ^ n ^ 90x^' Em{x)~ E4N+2{x) ^ 

Using these proposition above and a rational approximation argument, we observe a range of sprawls that 
can be achieved in Z^. 



1.25 




1.35 




Figure 3. Range of sprawls known for 1? . 

Corollary 12. A dense subset of the interval |] is contained in the set {E{1? , S) : gensets S}. 

Proof. There is a continuous path Lt through the space of perimeters that starts with the circle and ends 
with the square. The sprawl passes through all values from A/tt to 4/3 along the path. 

Any such perimeter Lt can be approximated arbitrarily closely by a rational polygon, which can be 
rescaled to an integer polygon without changing E. The sprawl of a polygon is continuous in the coordinates 
of its vertices, and E of the approximants approaches E of the original body. Finally, the set of integer 
vertices can be completed to a generating set without changing E, since the sprawl only depends on the 
extreme vertices. □ 

4.3. Hexagons. Let H^^y be the hexagon with vertices Vi = (x, y), \i2 = (1, 1), V3 = (—1, 1), where x > 1, 
y > 0, and x-\-y <2. Thus Hi q is a square (realized as a degenerate hexagon) and if2,o is a linear transform 
of the regular hexagon, giving 

E{Hi^o) = -, E{H2fl) = — ■ 

Lemma 13 (Parametrizing hexagons). Every convex, centrally symmetric hexagon is equivalent by a linear 
transformation to some H^^y 




■yi 



Figure 4. Hexagon reduction. On top we have shown the choices of which pair of sides to 
map to horizontal; the middle figure has vi in the desired position. 



Proof. Take a hexagon with vertices Vi, V2, V3, — vi, — V2, — V3. We can always find a linear transformation 
sending V2 1—^ (1, 1) and V3 1-^ (—1, 1). This reduces the parameter space to {vi — {x,y) : x > 1, —1 < y < 1}. 
Also, without loss of generality, we have x + |y| < 2; otherwise, change the choice of V2,V3, as in Figure |4l 
Finally, up to reflection in one of the coordinate axes, we can assume y > 0. □ 

Applying the algorithm sketched above, we can compute the side-pair averages, and obtain the following 
formula for a hexagon parametrized as above. 
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E{H ) — ^^^^ ^ ''^^ ^ ^^^^ ~ + '''^^y - 2/^ + 4x + 5y + 1 

Thus we have reduced the task of bounding the sprawl of hexagons to a calculus exercise (which we omit) : 
verifying that in the domain defined by x > 1, y > 0, and x + y < 2, this quantity takes values between 
23/18 and 4/3. 

This establishes the following statement: 

Theorem 14 (Sprawls of hexagons and three-generator presentations). 

"23 4" 

{E{H): hexagons -,- 

Thus, fl < E{I?,S) < I whenever \S\ < 6. 

This provides evidence, taken together with the fast convergence for sprawls of regular polyhedra towards 
4/7r, for the following conjecture. 

Conjecture 15 (Sprawl Conjecture for d = 2). The circle and the square are the extreme cases for all 
perimeters in . That is, 

[4 4" 

{E(L) : perimeters L C = -, - 

[tt 3 

Further evidence is given in the next section, where the sphere and cube are shown to be sharp bounds 
asymptotically as d — ^ oo. 

5. Sprawl in d dimensions: The not-so-flatness of Z'' 

In higher dimensions, the computation of expected distance between two points becomes quite intuitive 
for the sphere and the cube. Suppose d is very large. For the round unit sphere Sphere^ C K'', which induces 
the Euclidean metric as its Minkowski norm, take one point to be at the north pole without loss of generality. 
Then concentration of measure phenomena ensure that the second point is almost surely on the equator, so 
the distance between them is nearly + 1^ = ^/2. On the other hand, the cube induces the sup metric. 
In this case, the distance computation is performed by sampling the random variable \xi — yi|, which ranges 
between and 2, a total of d times. For very large d, we should expect this supremum to tend to 2. This 
reasoning predicts that i?(Spherej;) — t- -\/2 and ^^(Cubed) — >■ 2; the former can be approximated and the 
latter can be exactly realized by a word metric. What about the group Z"^ with its standard generating set? 
In dimension 2, this is isometric to the cube metric, but that is no longer true for d > 2. In dimension 3, 
the limit shape for the standard word metric is an octahedron, and more generally in dimension d it is the 
join of d copies of 5°, called an orthoplex (or cross-polytope) . We will derive the answer below, finding that 
E{Z'^,5td) |. 

By way of interpretation, this says that a cubical generating set gives Z"^' more and more hyperbolic-like 
geometry as d gets large, while the standard word metric is bounded uniformly away (see Figure [S]). We are 
accustomed to describing the group Z'^ as "flat" because it is quasi-isometric to Euclidean space. However, 
using this statistic that gives a finer measure of large-scale curvature, we see that the standard generators 
give more of a hyperbolic character to the group, and that there exist generators for large d which make the 
geometry a good deal closer to hyperbolic than fiat. 

In the computations below, recall that for natural numbers n, the double factorial n!! denotes the product 
of all the natural numbers up to n that have the same parity: 

n!! = Y[ in-2i). 

i: 0<2i<n 

Double factorials will occur in the calculations, but they can be re-expressed in two cases: 

(2n + l)! 



(2n)!! = 2"-n! ; (2n + 1) 



II ^ 



2" • n! 

To get rates of approach, we use an approximation for n\ that goes one term beyond Stirling's formula: 

n! = ^/2^(f)"(l + ^-^0(;f,)). 
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5.1. The sphere. 

Proposition 16. The sphere induces the P metric on W^. The formula for the sprawl of the sphere is given 
in the following closed form: 



2'^-'^ T(-d) 



2 



^/? r(rf-i)- 
Thus, E{Spheve^) \/2 as d^ oo, with 

\/2 - ^(Sphered) ^ ^. 

Proof. Recall that, where denotes the surface area of S'^ (so that Ai = 2tt and A2 = 47r), there is a 
recursive formula given by A/- = Ak-i sin'^~^(^) d9. The distance between two points on the sphere that 
subtend an angle 6 at the origin is \/2 — 2cos^. Then we find that the Ak terms cancel out, giving 

^(SphereJ = 



/;sin'^-^(^)d^ 



which can be computed explicitly. 
Let 



PTT PIT 

a„= / \/2- 2 cos 61 sin" 61^61, 6„ = / sin"6'rf6', 

^0 Jo 
so that E(Sph.erej) = ad-2/bd-2- 

Integrating by parts gives 6„+2 = so since bo = tt and bi = 2, we get 6„ = c„ ^"~m^" , with c„ = tt 

if n is even and 2 if n is odd. 

Change of variables and integration by parts gives the recursion a„+i = 2n+3 ^^' Since uq = 4, this gives 

„ _ A (2»)!! 
~ ^(2n+l)!! • 

Combining and re-indexing wtih d = n + 2, we get 

, X ad-2 (2d -4)!! (d-2)!! 
i?(Sphere,) = ^ = e.j^j^jj^^ 

where is 4/7r if d is even, and 2 if d is odd. Re-expressing the double factorials completes the proof. Note 
that the use of the gamma function enables us to drop the dependence on parity of d because r(^) is an 
integer for whole numbers z but has ^/tF in the denominator for half-integers z. □ 

5.2. The cube. 

Proposition 17. The cube is the limit shape for Z"^ with a nonstandard generating set {±ei • • • ± Cd}, and 
it induces the metric on M''. The formula for the sprawl of the cube is given in the following closed form: 

£;(Cube.) = ^ 

Thus, S(Cubed) -^2 as d ^ 00, with 

2-£;(Cubed)~^. 

Proof. Let Xi and jji be independently distributed uniformly on the interval / = [—1, 1]. Wc will use these 
random variables to compute the sprawl for Cube^, which we identify with the {d — l)-complex in M.'^ with 
vertices (±1, . . . , ±1). To fix notation: Cubei is a pair of points on the line and Cube2 is a square in the 
plane. Cube^ has 2d top-dimensional facets, each a copy of Note that each facet is the locus of points 

satisfying a^j = c for c = ±1. It has exactly one opposite face {xi = — c), and all the others are adjacent since 
the defining equations can be simultaneously satisfied. For a point in R** to be in Cube^, all coordinates 
must be in /, and at least one of its coordinates must be ±1. 
We compute 

P{\xi-yi\<r) = ^^; P{\1 - < r = - 
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by considering the uniform measure on the square and caleulating the portion of the area between the 
lines X — y = r and y — x = r m. the first case, and above the hne y = r m. the second. From this we get 
cumulative distribution functions 

(4r - r2)<^-i 

-Fsame('') = -P(rfoo(x, y) < r : X, y On same face) = -^-^ ; 

(4r - r'^Y~'^ /r\2 

Fadii^) = -P(rfoo(x,y) < r : x,y on adjacent faces) = (^-J . 

To find expectations, we integrate j^rF'^r) dr. 

The fi-cube has 2d faces, so if x is placed randomly, then the probability that y is on the same face or on 
the opposite face is l/2d in each case, while all of the other 2d — 2 faces are in the adjacent case. Recalling 
that the distance between any two points on opposite faces is 2, we get 

^ , 1 • 2 + 1 • rF;,^e(0 dr) + {2d - 2) ■ rF'^.jr) dr 
E{Guhe,) = . 

Prom this and some algebraic manipulation we derive 



i;(Cubed) = - 



1 d-1 



d ¥^^d 



f r'^-^A-r^-'^ dr + {d-l) f r'^ {4 - rf-"^ dr + {2 - d) f r'^+Hfi - rY'"^ dr 
Jo Jo Jo 



Let's let = r™(4 — r)" dr. Integration by parts and some further manipulations will give recursive 
formulas, for instance 

_ 2 2n 
2n + l 

which simplifies to /„,„ = 2^"+^ (in+i)!! since /o,o = 2. 

The In+i,m In+2,ni and In+A,n Eire derived similarly, from which we find 

Re-expressing the double factorials completes the proof. □ 
5.3. The orthoplex. 

Proposition 18. The orthoplex is the limit shape for TJ^ with its standard generating set ±{6^}, and it 
induces the £^ metric on R"^. The formula for the sprawl of the orthoplex is given in the following closed 
form: 

Thus, £^(Orthd) — >■ | as d ^ 00, with 

^-i^(Orth,)^l. 

Proof. First note that by symmetry, the expectation of ||x— y||i is equal to d times the expectation of |a;i — 
Thus 



-B(Orthd) = d \xi-yi\ d^{xi) dfi{yi), 

where dfi is the measure induced by on a single coordinate axis of M''. That measure is given by 

(l-|xi|)'^-^ ^ 
Mxi) = — 1^^_2)\ — 

as can be verified by considering how much volume the orthoplex has at height xi. We can renormalize to 
a probability measure by taking u = ^'^"^^^' /x, so that /orthd^ di'^ = 1- Thus we are calculating 

EiOvtha) =df = til! /■ \x-y\.{l-\x\f-^{l-\y\f-^ dxdy. 



72 
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But again by symmetry, this is just 



2d{d-lf [ x{l-xf-^ [ {1-yf-^dydx 



Evaluating in y and then performing hght manipulation gives us 

nl j-1 

'"^^^ dx- x{l - xf"^-^ dx 
Jo 



2d{d- 1) 
as desired. 



x{i-xy 







= 2d{d- 1) 



{d-l)d (2d- 2)(2d- 1) 



3d - 2 
2d - 1' 

□ 











( ) 


( ) 




, 









V2 



Figure 5. Ranges of sprawls: [i?(Spherej;), E{C\ihed)\ is shown for d = 2, 3, 4, 5, 100, oo. 



5.4. The range of sprawls and the Mahler conjecture. By rational approximation of convex bodies (as 
in the proof of Corollary [T^), we find that a dense subset of the interval [i?(Sphere£;), ^^(Cubeci)] is contained 
in the set of values realized by groups, so 



[£;(Sphererf),i;(Cubed)] C {EiZ'^^S) : gensets S}. 
We conclude by conjecturing that this is everything. 
Conjecture 19 (Sprawl Conjecture). The sphere and the cube are the extremes for the sprawl. That is, 

{E{L) : perimeters L C R''} = [E{Sphere^), E{Cuhed)] . 

This conjecture would complete the description for free abelian groups of the dependence of this curvature 
statistic on the generating set, showing the values to be "pinched" as in Figure [3 

A similar conjecture could be formulated for the balls instead of the spheres: consider the average distance 
statistic for convex, centrally symmetric J7 C K'^ defined by 

/o2 l|x- ylk2 dVoi^ 



E(r2) 



Here, it is known (by the Brascamp-Lieb-Luttinger inequality [TH Thm 1]) that E is minimized by (round) 
balls and ellipsoids, but the question of verifying that it is maximized by cubes is open. 

Some evidence for the Sprawl Conjecture can be found in the high-dimensional asymptotics. Because 
E < 2 always, it is immediate that 

lim sup{E{L)} = lim E{Guhed) = 2. 

d^oG d^oo 

Arias-de-Reyna, Ball, and Villa consider E(r2) and prove that for almost all pairs of points in x il, the 
distance is greater than ^/2{1 — e) [TJ Thm 1]. As they note, the points in the ball become concentrated in 
its boundary as d — cx). This shows that the i?(Spherej;) is a lower bound for sprawl asymptotically, i.e., 

lim mi{E{L)} = lim i;(Sphere^) = V2. 
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The Sprawl Conjecture resembles another well-studied problem in convex geometry. For a convex, centrally 
symmetric body fi, define its polar body by 

rj° := {x e R'' : X • y < 1 Vy G 17}. 

Thus for instance, the sphere is its own polar body in every dimension, — il, and (Orthd)" = Cubcd. 

The Mahler volume of ft is defined to be M{il) = Vol(i7) • Vol(f2°). Let us also say that for any set A ~ —A, 
we write M (A) for the Mahler volume of the convex hull of A. Then, just as for the sprawl, this is a statistic 
that is continuous in D, and invariant under linear transformations; it has been described as measuring the 
"roundness" of the convex body. Mahler conjectured in 1939 that the extremes in every dimension were 
realized by the sphere and the cube. Santalo proved in 1949 that the spheres did indeed realize the upper 
bound on Mahler volume, but the lower bound is still an open problem, despite some interesting recent 
progress by Kuperberg and others. 

Above, we have staked out the point of view that, like the Mahler volume and other affine isoperimetric 
invariants, sprawl is measuring a quality of roundness versus pointiness of the shape L. Inspecting the 
estimates for sprawls of regular polygons derived after Proposition [11] shows something surprising: there is 
no point after which sprawl decreases monotonically as the number of sides in the polygon increases. Thus, 
regular polygons with 4fc — 2 sides are a bit "rounder" than regular polygons with 4A; sides (for all fc > 4) , 
even though they have fewer sides. On the other hand, as measured by Mahler volume the roundness of 
regular polygons increases monotonically in the number of sides. 

Finally, we note that the average distance between two points on the round sphere is precisely equal to 
the constant 7„ that Kuperberg uses to state the inequality in his [15, Corollary 1.6], where it is described 
as "a monotonic factor that begins at i/n and converges to -\/2-" Recognizing the geometric meaning of this 
constant allows his result to be rephrased as 

M{n) > (f • EiSpheie^) ■ M(Cubed). 

The fact that this general inequality for Mahler volume should be so simply stated involving the sprawl is, 
we hope, intriguing. 
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